Effective lexical tree search for large vocabulary continuous speech recognition

نویسندگان

  • Hiroki Yamamoto
  • Toshiaki Fukada
  • Yasuhiro Komori
چکیده

In this paper, we present an e cient calculation of the factored LM probabilities for speeding up the large vocabulary continuous speech recognition. We introduced a novel technique based on the independent calculation of the factored LM probability. The basic idea of the proposed method is that each factored LM probability is calculated on-demand for a new combination of a previous word hypothesis and a LM look-ahead tree node, instead of calculating all the factored LM probabilities over the tree at a time. The speaker-independent continuous speech recognition experiment was performed for 20 speakers on a 60k word newspaper dictation task. As a result, the proposed method achieved 25% improvement in speed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Input Acoustic Analysis Phoneme Inventory Pronunciation Lexicon Language Model

This paper gives an overview of an architecture and search organization for large vocabulary, continuous speech recognition (LVCSR at RWTH). In the rst part of the paper, we describe the principle and architecture of a LVCSR system. In particular, the issues of modeling and search for phoneme based recognition are discussed. In the second part, we review the word conditioned lexical tree search...

متن کامل

Speech Input Acoustic Analysis Phoneme Inventory Pronunciation Lexicon

This paper gives an overview of an architecture and search organization for large vocabulary, continuous speech recognition (LVCSR at RWTH). In the rst part of the paper, we describe the principle and architecture of a LVCSR system. In particular, the issues of modeling and search for phoneme based recognition are discussed. In the second part, we review the word conditioned lexical tree search...

متن کامل

Segmental search for continuous speech recognition

The paper illustrates a search strategy for continuous speech recognition based on the recently developed Fast Segmental Viterbi Algorithm (FSVA) [5], a new search strategy particularly e ective for very large vocabulary word recognition. The FSVA search has been extended to deal with continuous speech using a network that merges a general lexical tree and a set of bigram subtrees generated on ...

متن کامل

An efficient lexical tree search for large vocabulary continuous speech recognition

This paper describes an efficient search algorithm for a high speed and high accuracy LVCSR system. A conventionally used lexical tree search is an efficient method, but has a problem in incorporating the language probability. To solve this problem, we propose in this paper a new efficient search algorithm incorporating the language model structure. In our developed LVCSR, 2-pass search algorit...

متن کامل

Improvements in search algorithm for large vocabulary continuous speech recognition

Current time-synchronous beam-search algorithm is improved from two aspects for speeding up large vocabulary continuous speech recognition. Single-triphone-tree structure is proposed to take instead of the tree copy technique for simplifying the search computation and saving the memory . By one kind of special-designed token propagation strategy, the n-gram language model can be integrated into...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000